SEQUENCE LISTING 



<110> Donnelly, John J. 
Liu, Margaret A. 
Shiver, John W. 
Fu, Tong-Ming 



<120> SYNTHETIC HEPATITIS C GENES 



<130> 19732YPCA 



<140> 09/194, 949 
<141> 2000-02-17 



<150> PCT/US97/09884 
<151> 1997-06-06 



<150> 60/033,534 
<151> 1996-12-20 



<150> 60/020,494 
<151> 1996-06-11 



<160> 25 



<170> FastSEQ for Windows Version 4.0 



<210> 1 
<211> 3610 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Modified Vector Sequence 



<400> 1 

gatattggct 

ggctcatgtc 

tcaattacgg 

gtaaatggcc 

tatgttccca 

cggtaaactg 

gacgtcaatg 

tttcctactt 

tggcagtaca 

cccattgacg 

cgtaacaact 

ataagcagag 

gacctccata 

acgcggattc 

cccttggctt 

ctcatgttat 

ccactcccct 

aactctcttt 

atttttacag 



attggccatt 
caacattacc 
ggtcattagt 
cgcctggctg 
tagtaacgcc 
cccacttggc 
acggtaaatg 
ggcagtacat 
tcaatgggcg 
tcaatgggag 
ccgccccatt 
ctcgtttagt 
gaagacaccg 
cccgtgccaa 
cttatgcatg 
aggtgatggt 
attggtgacg 
attggctata 
gatggggtct 



gcatacgttg 
gccatgttga 
tcatagccca 
accgcccaac 
aatagggact 
agtacatcaa 
gcccgcctgg 
ctacgtatta 
tggatagcgg 
tttgttttgg 
gacgcaaatg 
gaaccgtcag 
ggaccgatcc 
gagtgacgta 
ctatactgtt 
atagcttagc 
atactttcca 
tgccaataca 
catttattat 



tatccatatc 
cattgattat 
tatatggagt 
gacccccgcc 
ttccattgac 
gtgtatcata 
cattatgccc 
gtcatcgcta 
tttgactcac 
caccaaaatc 
ggcggtaggc 
atcgcctgga 
agcctccgcg 
agtaccgcct 
tttggcttgg 
ctataggtgt 
ttactaatcc 
ctgtccttca 
ttacaaattc 



ataatatgta 
tgactagtta 
tccgcgttac 
cattgacgtc 
gtcaatgggt 
tgccaagtac 
agtacatgac 
ttaccatggt 
ggggatttcc 
aacgggactt 
gtgtacggtg 
gacgccatcc 
gccgggaacg 
atagagtcta 
ggtctataca 
gggttattga 
ataacatggc 
gagactgaca 
acatatacaa 



catttatatt 
ttaatagtaa 
ataacttacg 
aataatgacg 
ggagtattta 
gccccctatt 
cttatgggac 
gatgcggttt 
aagtctccac 
tccaaaatgt 
ggaggtctat 
acgctgtttt 
gtgcattgga 
taggcccacc 
cccccgcttc 
ccattattga 
tctttgccac 
cggactctgt 
caccaccgtc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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cccagtgccc gcagttttta ttaaacataa cgtgggatct ccacgcgaat ctcgggtacg 1200 
tgttccggac atgggctctt ctccggtagc ggcggagctt ctacatccga gccctgctcc 1260 
catgcctcca gcgactcatg gtcgctcggc agctccttgc tcctaacagt ggaggccaga 1320 
cttaggcaca gcacgatgcc caccaccacc agtgtgccgc acaaggccgt ggcggtaggg 1380 
tatgtgtctg aaaatgagct cggggagcgg gcttgcaccg ctgacgcatt tggaagactt 1440 
aaggcagcgg cagaagaaga tgcaggcagc tgagttgttg tgttctgata agagtcagag 1500 
gtaactcccg ttgcggtgct gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1560 
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1620 
ggtcttttct gcagtcaccg tccttagatc taggtaccag atatcagaat tcagtcgaca 1680 
gcggccgcga tctgctgtgc cttctagttg ccagccatct gttgtttgcc cctcccccgt 1740 
gccttccttg accctggaag gtgccactcc cactgtcctt tcctaataaa atgaggaaat 1800 
tgcatcgcat tgtctgagta ggtgtcattc tattctgggg ggtggggtgg ggcagcacag 1860 
caagggggag gattgggaag acaatagcag gcatgctggg gatgcggtgg gctctatggg 1920 
tacggccgca gcggccttaa ttaaggccgc agcggccgta cccaggtgct gaagaattga 1980 
cccggttcct cgacccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc 2040 
cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta 2100 
taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg 2160 
ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcaatgc 2220 
tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac 2280 
gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac 2340 
ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg 2400 
aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga 2460 
aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt 2520 
agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag 2580 
cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacgtgatcc 2640 
cgtaatgctc tgccagtgtt acaaccaatt aaccaattct gattagaaaa actcatcgag 2700 
catcaaatga aactgcaatt tattcatatc aggattatca ataccatatt tttgaaaaag 2760 
ccgtttctgt aatgaaggag aaaactcacc gaggcagttc cataggatgg caagatcctg 2820 
gtatcggtct gcgattccga ctcgtccaac atcaatacaa cctattaatt tcccctcgtc 2880 
aaaaataagg ttatcaagtg agaaatcacc atgagtgacg actgaatccg gtgagaatgg 2940 
caaaagctta tgcatttctt tccagacttg ttcaacaggc cagccattac gctcgtcatc 3000 
aaaatcactc gcatcaacca aaccgttatt cattcgtgat tgcgcctgag cgagacgaaa 3060 
tacgcgatcg ctgttaaaag gacaattaca aacaggaatc gaatgcaacc ggcgcaggaa 3120 
cactgccagc gcatcaacaa tattttcacc tgaatcagga tattcttcta atacctggaa 3180 
tgctgttttc ccggggatcg cagtggtgag taaccatgca tcatcaggag tacggataaa 3240 
atgcttgatg gtcggaagag gcataaattc cgtcagccag tttagtctga ccatctcatc 3300 
tgtaacatca ttggcaacgc tacctttgcc atgtttcaga aacaactctg gcgcatcggg 3360 
cttcccatac aatcgataga ttgtcgcacc tgattgcccg acattatcgc gagcccattt 3420 
atacccatat aaatcagcat ccatgttgga atttaatcgc ggcctcgagc aagacgtttc 3480 
ccgttgaata tggctcataa caccccttgt attactgttt atgtaagcag acagttttat 3540 
tgttcatgat gatatatttt tatcttgtgc aatgtaacat cagagatttt gagacacaac 3600 
gtggctttcc 3610 

<210> 2 
<211> 573 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Optimized sequence encoding HCV core antigen 
<400> 2 

atgagcacca accccaagcc ccagaggaag accaagagga acaccaacag gaggccccag 60 
gatgtgaagt tccctggggg aggccagatt gtgggagggg tctacctgct gcccaggagg 120 
ggccccaggc tgggggtgag ggctaccagg aagacctctg agaggtccca gcccaggggc 180 
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aggaggcagc ccatccccaa ggccaggagg cctgagggcc gctcctgggc ccagcctggc 240 

tacccctggc ccctgtatgg caatgaaggc tttggctggg ctggctggct gctgtccccc 300 

aggggctcca ggccctcctg gggccccaca gaccccagga ggaggtccag gaacctgggc 360 

aaggtgattg acaccctgac ctgtggcttt gctgacctga tgggctacat ccccctggtg 420 

ggggctcctg tgggaggggt ggctagggct ctggctcatg gggtgagggt gctggaggat 480 

ggggtgaact atgctactgg caacctgcct ggctgctcct tctccatctt cctgctggcc 540 

ctgctctcct gcctgacagt gcctgcttct gcc 573 



<210> 3 

<211> 191 

<212> PRT 

<213> Hepatitis C Virus 



<400> 3 



Met 


Ser 


Thr 


Asn 


Pro 


Lys 


Pro 


Gin 


1 








5 








Arg 


Arg 


Pro 


Gin 


Asp 


Val 


Lys 


Phe 








20 










Gly 


Val 


Tyr 


Leu 


Leu 


Pro 


Arg 


Arg 






35 










40 


Thr 


Arg 


Lys 


Thr 


Ser 


Glu 


Arg 


Ser 




50 










55 




He 


Pro 


Lys 


Ala 


Arg 


Arg 


Pro 


Glu 


65 










70 






Tyr 


Pro 


Trp 


Pro 


Leu 


Tyr 


Gly 


Asn 










85 








Leu 


Leu 


Ser 


Pro 


Arg 


Gly 


Ser 


Arg 








100 










Arg 


Arg 


Arg 


Ser 


Arg 


Asn 


Leu 


Gly 






115 










120 


Gly 


Phe 


Ala 


Asp 


Leu 


Met 


Gly 


Tyr 




130 










135 




Gly 


Gly 


Val 


Ala 


Arg 


Ala 


Leu 


Ala 


145 










150 






Gly 


Val 


Asn 


Tyr 


Ala 


Thr 


Gly 


Asn 










165 








Phe 


Leu 


Leu 


Ala 


Leu 


Leu 


Ser 


Cys 








180 











Arg 


Lys 


Thr 


Lys 


Arg 


Asn 


Thr 


Asn 




10 










15 




Pro 


Gly 


Gly 


Gly 


Gin 


He 


Val 


Gly 


25 










30 






Gly 


Pro 


Arg 


Leu 


Gly 


Val 


Arg 


Ala 










45 








Gin 


Pro 


Arg 


Gly 


Arg 


Arg 


Gin 


Pro 








60 










Gly 


Arg 


Ser 


Trp 


Ala 


Gin 


Pro 


Gly 






75 










80 


Glu 


Gly 


Phe 


Gly 


Trp 


Ala 


Gly 


Trp 




90 










95 




Pro 


Ser 


Trp 


Gly 


Pro 


Thr 


Asp 


Pro 


105 










110 






Lys 


Val 


He 


Asp 


Thr 


Leu 


Thr 


Cys 










125 








He 


Pro 


Leu 


Val 


Gly 


Ala 


Pro 


Val 








140 










His 


Gly 


Val 


Arg 


Val 


Leu 


Glu 


Asp 






155 










160 


Leu 


Pro 


Gly 


Cys 


Ser 


Phe 


Ser 


He 




170 










175 




Leu 


Thr 


Val 


Pro 


Ala 


Ser 


Ala 




185 










190 







<210> 4 
<211> 103 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Modified Vector Sequence 



<400> 4 

gaattcgctt ccaatgagaa catggagacc atgaaccagc cctaccacat ctgccgcggc 60 
ttcacctgct tcaagaagta aacccgggaa ttctaaagtc gac 103 



<210> 5 
<211> 573 
<212> DNA 

<213> Hepatitis C Virus 
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<400> 5 

atgagcacga 

gacgtcaagt 

ggccccaggt 

aggcgacagc 

tacccttggc 

cgcggctctc 

aaggtcatcg 

ggcgcccccg 

ggggtgaact 

ctgctgtcct 



atcctaaacc 
tcccgggcgg 
tgggtgtgcg 
ctatccccaa 
ccctctatgg 
ggcctagttg 
ataccctcac 
tagggggcgt 
atgcaacagg 
gcctgaccgt 



tcaaagaaaa 
tggtcagatc 
cgcgactagg 
ggctcgccgg 
caatgagggc 
gggccccact 
gtgcggcttc 
cgccagggcc 
gaatttgccc 
cccagcttct 



accaaacgta 
gttggtggag 
aagacttccg 
cccgagggca 
ttcgggtggg 
gacccccggc 
gccgacctca 
ctggcgcatg 
ggttgctctt 
get 



acaccaaccg 
tttacttgtt 
ageggtcgea 
ggtcctgggc 
caggatggct 
gtaggtcgcg 
tggggtacat 
gcgtcagggt 
tctctatctt 



ccgcccacag 
gccgcgcagg 
acctcgtgga 
tcagcccggg 
cctgtccccc 
caatttgggt 
cccgctcgtc 
tctggaggac 
cctcctggct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

573 



<210> 6 
<211> 582 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Optimized sequence encoding HCV El protein 



<400> 6 

atgtatgagg tgaggaatgt ctctggcgtc taccatgtga ccaatgactg ctccaactcc 60 
tgcattgtct atgaggctgc tgacatgatc atgcacaccc ctggctgtgt gccatgtgtg 120 
agggagggca actcctccag gtgctgggtg gccctgaccc ccaccctggc tgecaggaac 180 
tcctccatcc ccaccaccac catcaggagg catgtggacc tgctggtggg cgctgctgcc 240 
ctgtgctctg ccatgtatgt gggcgacctg tgtggctctg tcttcctggt gtcccagctg 300 
ttcaccttct cccccaggag gtatgagact gtgeaggact gcaactgctc cctgtaccct 360 
ggccatgtct ctggccacag gatggcctgg gacatgatga tgaactggtc ccccaccact 420 
gccctggtgg tctcccagct gctgaggatc ccccaggctg tggtggacat ggtggtgggc 480 
gcccactggg gcgtgctggc tggcctggcc tactactcca tggtgggcaa ctgggccaag 540 
gtgctgattg tgatgetget gtttgctggc gtggatggct aa 582 



<210> 7 
<211> 193 
<212> PRT 

<213> Hepatitis C Virus 



<400> 7 



Met 


Tyr 


Glu 


Val 


Arg 


Asn 


Val 


Ser 


Gly 


Val 


Tyr 


His 


Val 


Thr 


Asn 


Asp 


1 








5 










10 










15 




Cys 


Ser 


Asn 


Ser 


Cys 


He 


Val 


Tyr 


Glu 


Ala 


Ala 


Asp 


Met 


He 


Met 


His 








20 










25 










30 






Thr 


Pro 


Gly 


Cys 


Val 


Pro 


Cys 


Val 


Arg 


Glu 


Gly 


Asn 


Ser 


Ser 


Arg 


Cys 






35 










40 










45 








Trp 


Val 


Ala 


Leu 


Thr 


Pro 


Thr 


Leu 


Ala 


Ala 


Arg 


Asn 


Ser 


Ser 


He 


Pro 




50 










55 










60 










Thr 


Thr 


Thr 


He 


Arg 


Arg 


His 


Val 


Asp 


Leu 


Leu 


Val 


Gly 


Ala 


Ala 


Ala 


65 










70 










75 










80 


Leu 


Cys 


Ser 


Ala 


Met 


Tyr 


Val 


Gly 


Asp 


Leu 


Cys 


Gly 


Ser 


Val 


Phe 


Leu 










85 










90 










95 




Val 


Ser 


Gin 


Leu 


Phe 


Thr 


Phe 


Ser 


Pro 


Arg 


Arg 


Tyr 


Glu 


Thr 


Val 


Gin 








100 










105 










110 






Asp 


Cys 


Asn 


Cys 


Ser 


Leu 


Tyr 


Pro 


Gly 


His 


Val 


Ser 


Gly 


His 


Arg 


Met 



115 120 125 
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Ala Trp Asp Met Met 
130 

Ser Gin Leu Leu Arg 
145 

Ala His Trp Gly Val 
165 

Asn Trp Ala Lys Val 
180 

Gly 



Met Asn Trp Ser Pro Thr 
135 

He Pro Gin Ala Val Val 
150 155 
Leu Ala Gly Leu Ala Tyr 
170 

Leu He Val Met Leu Leu 
185 



Thr Ala Leu Val Val 
140 

Asp Met Val Val Gly 
160 

Tyr Ser Met Val Gly 
175 

Phe Ala Gly Val Asp 
190 



<210> 8 
<211> 1044 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Optimized sequence encoding HCV E2 protein 



<400> 8 

atgaccacct 

ttctcccctg 

aacaggactg 

tatgtgaaga 

gacaggtttg 

aggccatact 

tgtggccctg 

ggcgtgccca 

aggccccccc 

acctgtggcg 

actgactgct 

ctgaccccca 

aacttcacca 

gcctgcaact 

tcccccctgc 

cctgccctgt 

tacggcgtgg 

ttcctgctgc 



atgtctctgt 
gctctgccca 
ccctgaattg 
agttcaactc 
cccagggctg 
gctggcacta 
tctactgctt 
cctacaactg 
agggcaactg 
gccccccatg 
tcaggaagca 
ggtgcatggt 
tcttcaagat 
ggaccagggg 
tgctgtccac 
ccactggcct 
gctccgctgt 
tggctgatgc 



gggccatgcc 
gaagatccag 
caacgagtcc 
ctctggctgc 
gggccccatc 
tgccccccag 
caccccatcc 
gggcgacaat 
gtttggctgc 
caacattggc 
tcctgaggcc 
ggactaccca 
caggatgtat 
cgagaggtgc 
cactgagtgg 
gatccatctg 
ggtctccatt 
ctaa 



tcccagacca 
ctggtgaaca 
atcaacactg 
tctgagagga 
acccatgctg 
ccatgtggca 
cctgtggtgg 
gagactgatg 
acctggatga 
ggcgctggca 
acctacacca 
tacaggctgt 
gtgggcggcg 
aacattgagg 
cagatcctgc 
catcagaaca 
gtgatcaagt 



ccaggagggt 
ccaatggctc 
gcttctttgc 
tggcctcctg 
agtccaggtc 
ttgtgcctgc 
tgggcaccac 
tgctgctgct 
actccactgg 
acaacaccct 
agtgtggctc 
ggcactaccc 
tggagcacag 
acagggacag 
catgctcctt 
ttgtggatgt 
gggagtatgt 



ggcctccttc 
ctggcacatc 
tgccctgttc 
caggcccatt 
ctctgaccag 
cctgcaggtc 
tgacaggttt 
gaacaacacc 
cttcaccaag 
gacctgcccc 
tggcccatgg 
atgcaccttc 
gctgaatgct 
gtctgagctg 
caccaccctg 
gcagtacctg 
gctgctgctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1044 



<210> 9 
<211> 347 
<212> PRT 

<213> Hepatitis C Virus 



<400> 9 



Met 


Thr 


Thr 


Tyr 


Val 


Ser 


Val 


Gly 


His 


Ala 


Ser 


Gin 


Thr 


Thr 


Arg 


Arg 


1 








5 










10 










15 




Val 


Ala 


Ser 


Phe 


Phe 


Ser 


Pro 


Gly 


Ser 


Ala 


Gin 


Lys 


He 


Gin 


Leu 


Val 








20 










25 










30 






Asn 


Thr 


Asn 


Gly 


Ser 


Trp 


His 


He 


Asn 


Arg 


Thr 


Ala 


Leu 


Asn 


Cys 


Asn 






35 










40 










45 








Glu 


Ser 


He 


Asn 


Thr 


Gly 


Phe 


Phe 


Ala 


Ala 


Leu 


Phe 


Tyr 


Val 


Lys 


Lys 




50 










55 










60 










Phe 


Asn 


Ser 


Ser 


Gly 


Cys 


Ser 


Glu 


Arg 


Met 


Ala 


Ser 


Cys 


Arg 


Pro 


lie 


65 










70 










75 










80 
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Asp Arg Phe Ala 

Ser Ser Asp Gin 
100 

Gly He Val Pro 
115 

Pro Ser Pro Val 
130 

Tyr Asn Trp Gly 
145 

Arg Pro Pro Gin 

Gly Phe Thr Lys 
180 

Gly Asn Asn Thr 
195 

Glu Ala Thr Tyr 
210 

Cys Met Val Asp 
225 

Asn Phe Thr He 

Arg Leu Asn Ala 
2 60 

Glu Asp Arg Asp 
275 

Glu Trp Gin He 
290 

Thr Gly Leu He 
305 

Tyr Gly Val Gly 

Val Leu Leu Leu 
340 



Gin 


Gly 




Gly 


85 








Arg 


Pro 




Cys 


Ala 


Leu 


Gin 


Val 








12 0 


Val 


Val 


Gly 


Thr 






135 




Asp 


Asn 


Glu 


Thr 




150 






oiy 


ZX n 
noil 


T r* Ti 


Phe 


165 








JL i 1 -L 




\j .L y 


vjj j- _y 


Leu 


Thr 


Cys 


Pro 








2 0 0 


Thr 


Lys 


Cys 


Gly 






215 




T \/ r~ 


Pro 


Tvr 

y 


Arg 




2 30 






Phe 


Lys 


He 


Arg 


? 4 S 








Ala 


Cys 


Asn 


T rn 


Arg 


Ser 


Glu 


Leu 








280 


Leu 


Pro 


Cys 


Ser 






295 




His 


Leu 


His 


Gin 




310 






Ser 


Ala 


Val 


Val 


325 








Phe 


Leu 


Leu 


Leu 



Pro He Thr His 
90 

Trp His Tyr Ala 
105 

Cys Gly Pro Val 

Thr Asp Arg Phe 
140 

Asp Val Leu Leu 
155 

Gly Cys Thr Trp 
170 

Pro Pro Cys Asn 
185 

Thr Asp Cys Phe 

Ser Gly Pro Trp 
220 

Leu Trp His Tyr 
235 

Met Tyr Val Gly 
250 

Thr Arg Gly Glu 
265 

Ser Pro Leu Leu 

Phe Thr Thr Leu 
300 

Asn He Val Asp 
315 

Ser He Val He 
330 

Ala Asp Ala 
345 



Ala Glu Ser Arg 
95 

Pro Gin Pro Cys 
110 

Tyr Cys Phe Thr 
125 

Gly Val Pro Thr 

Leu Asn Asn Thr 
160 

Met Asn Ser Thr 
175 

He Gly Gly Ala 
190 

Arg Lys His Pro 
205 

Leu Thr Pro Arg 

Pro Cys Thr Phe 
240 

Gly Val Glu His 
255 

Arg Cys Asn He 
270 

Leu Ser Thr Thr 
285 

Pro Ala Leu Ser 

Val Gin Tyr Leu 
320 

Lys Trp Glu Tyr 
335 



<210> 10 
<211> 1620 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Optimized sequence encoding HCV El + E2 proteins 



<400> 10 

atgtatgagg 

tgcattgtct 

agggagggca 

tcctccatcc 

ctgtgctctg 

ttcaccttct 

ggccatgtct 

gccctggtgg 

gcccactggg 

gtgctgattg 

catgcctccc 



tgaggaatgt 
atgaggctgc 
actcctccag 
ccaccaccac 
ccatgtatgt 
cccccaggag 
ctggccacag 
tctcccagct 
gcgtgctggc 
tgatgctgct 
agaccaccag 



ctctggcgtc 
tgacatgatc 
gtgctgggtg 
catcaggagg 
gggcgacctg 
gtatgagact 
gatggcctgg 
gctgaggatc 
tggcctggcc 
gtttgctggc 
gagggtggcc 



taccatgtga 
atgcacaccc 
gccctgaccc 
catgtggacc 
tgtggctctg 
gtgcaggact 
gacatgatga 
ccccaggctg 
tactactcca 
gtggatggca 
tccttcttct 



ccaatgactg 
ctggctgtgt 
ccaccctggc 
tgctggtggg 
tcttcctggt 
gcaactgctc 
tgaactggtc 
tggtggacat 
tggtgggcaa 
ccacctatgt 
cccctggctc 



ctccaactcc 
gccatgtgtg 
tgccaggaac 
cgctgctgcc 
gtcccagctg 
cctgtaccct 
ccccaccact 
ggtggtgggc 
ctgggccaag 
ctctgtgggc 
tgcccagaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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atccagctgg tgaacaccaa tggctcctgg cacatcaaca ggactgccct gaattgcaac 720 
gagtccatca acactggctt ctttgctgcc ctgttctatg tgaagaagtt caactcctct 780 
ggctgctctg agaggatggc ctcctgcagg cccattgaca ggtttgccca gggctggggc 840 
cccatcaccc atgctgagtc caggtcctct gaccagaggc catactgctg gcactatgcc 900 
ccccagccat gtggcattgt gcctgccctg caggtctgtg gccctgtcta ctgcttcacc 960 
ccatcccctg tggtggtggg caccactgac aggtttggcg tgcccaccta caactggggc 1020 
gacaatgaga ctgatgtgct gctgctgaac aacaccaggc ccccccaggg caactggttt 1080 
ggctgcacct ggatgaactc cactggcttc accaagacct gtggcggccc cccatgcaac 1140 
attggcggcg ctggcaacaa caccctgacc tgccccactg actgcttcag gaagcatcct 1200 
gaggccacct acaccaagtg tggctctggc ccatggctga cccccaggtg catggtggac 1260 
tacccataca ggctgtggca ctacccatgc accttcaact tcaccatctt caagatcagg 1320 
atgtatgtgg gcggcgtgga gcacaggctg aatgctgcct gcaactggac caggggcgag 1380 
aggtgcaaca ttgaggacag ggacaggtct gagctgtccc ccctgctgct gtccaccact 1440 
gagtggcaga tcctgccatg ctccttcacc accctgcctg ccctgtccac tggcctgatc 1500 
catctgcatc agaacattgt ggatgtgcag tacctgtacg gcgtgggctc cgctgtggtc 1560 
tccattgtga tcaagtggga gtatgtgctg ctgctgttcc tgctgctggc tgatgcctaa 1620 



<210> 11 
<211> 539 
<212> PRT 

<213> Hepatitis C Virus 



<400> 11 
Met Tyr Glu Val 
1 

Cys Ser Asn Ser 
20 

Thr Pro Gly Cys 
35 

Trp Val Ala Leu 
50 

Thr Thr Thr lie 
65 

Leu Cys Ser Ala 

Val Ser Gin Leu 
100 

Asp Cys Asn Cys 
115 

Ala Trp Asp Met 
130 

Ser Gin Leu Leu 
145 

Ala His Trp Gly 

Asn Trp Ala Lys 
180 

Gly Thr Thr Tyr 
195 

Val Ala Ser Phe 
210 

Asn Thr Asn Gly 
225 

Glu Ser lie Asn 



Arg Asn Val Ser 
5 

Cys lie Val Tyr 

Val Pro Cys Val 
40 

Thr Pro Thr Leu 
55 

Arg Arg His Val 
70 

Met Tyr Val Gly 
85 

Phe Thr Phe Ser 

Ser Leu Tyr Pro 
120 

Met Met Asn Trp 
135 

Arg lie Pro Gin 
150 

Val Leu Ala Gly 
165 

Val Leu lie Val 

Val Ser Val Gly 
200 

Phe Ser Pro Gly 
215 

Ser Trp His lie 
230 

Thr Gly Phe Phe 
245 



Gly Val Tyr His 
10 

Glu Ala Ala Asp 
25 

Arg Glu Gly Asn 

Ala Ala Arg Asn 
60 

Asp Leu Leu Val 
75 

Asp Leu Cys Gly 
90 

Pro Arg Arg Tyr 
105 

Gly His Val Ser 

Ser Pro Thr Thr 
140 

Ala Val Val Asp 
155 

Leu Ala Tyr Tyr 
170 

Met Leu Leu Phe 
185 

His Ala Ser Gin 

Ser Ala Gin Lys 
220 

Asn Arg Thr Ala 
235 

Ala Ala Leu Phe 
250 



Val Thr Asn Asp 
15 

Met lie Met His 
30 

Ser Ser Arg Cys 
45 

Ser Ser lie Pro 

Gly Ala Ala Ala 
8 0 

Ser Val Phe Leu 
95 

Glu Thr Val Gin 
110 

Gly His Arg Met 
125 

Ala Leu Val Val 

Met Val Val Gly 
160 

Ser Met Val Gly 
175 

Ala Gly Val Asp 
190 

Thr Thr Arg Arg 
205 

lie Gin Leu Val 

Leu Asn Cys Asn 
240 

Tyr Val Lys Lys 
255 
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rile 


Asn 


Q O T 


e 0 v- 


ui y 


C y s 


Ser 


Glu 


Arg 


Met 


Ala 


Ser 


Cys 


Arg 


Pro 


He 








2 60 










265 










270 






Asp 


Arg 


Php 


Ala 


Gin 


Gly 


Tro 


Gly 


Pro 


He 


Thr 


His 


Ala 


Glu 


Ser 


Arg 


27 5 










280 










285 








C 0 v- 




A en 


Gin 


Arg 


Pro 


Tvr 


Cys 


Tro 


His 


Tvr 
y 


Ala 


Pro 


Gin 


Pro 


Cys 














295 










300 










kj ± y 


1 -L e 


v a J. 


Pro 


Ala 


Leu 


Gin 


Val 


Cys 


Gl v 


Pro 


Val 


Tvr 


Cys 


Phe 


Thr 


J5 U O 




















315 










320 


Pro 


Ssr 


D v /-\ 

IT J. (J 


V d -L 


val 


Val 


u J- y 


Thr 


Thr 


Asp 


Arg 


Phe 


Glv 


Val 


Pro 


Thr 










3 <L ~J 










330 










335 




Tyr 


Asn 


Trp 


ui y 


jri o 1^ 


Sen 


Glu 


Thr 


Asp 


Val 


Leu 


Leu 


Leu 


Asn 


Asn 


Thr 






34 0 










345 










350 






Ar g 


Pro 


Pro 


r i n 
Lj± n 


w J- y 


Aon 
noil 


T-rn 


Phe 


Gl y 


Cys 


Thr 


Tro 


Met 


Asn 


Ser 


Thr 




? s s 

O J J 










360 










365 








biy 


r 116 


Th v 
lux. 


Ly s 


Th T 


Cy s 


Gly 


Gly 


Pro 


Pro 


Cys 


Asn 


He 


Glv 


Gly 


Ala 












375 










380 










ui y 


Asn 


7\ o n 


Th r 
lui 


Leu 


Thr 


C y s 


Pro 


Thr 


Asp 


Cys 


Phe 


Ara 


Lvs 


His 


Pro 


jo J 










390 










395 










400 


olU 


R. J- a. 


Th t- 
1 1 1 1 


i yr 


Th t 


L y s 


Cy s 


Gly 


Ser 


Gly 


Pro 


Tro 


Leu 


Thr 


Pro 


Arg 










H KJ ~J 










410 










415 




Cy s 


Me r. 


\/n "I 

Vdl 


Asp 


Tyr 


ri u 


T \7 T* 

i yr 


Arg 


Leu 


T rn 


His 


Tvr 


Pro 


Cys 


Thr 


Phe 






4 ? n 

H Z. U 










425 










430 






Asn 


"D v> ^ 


i n r 


i x e 


Ph o 
lr lit; 


L y s 


He 


Arg 


Met 


Tvr 
i y i 


Val 


Gly 


Glv 


Val 


Glu 


His 






d ^ s 

4 j J 










4 4 0 










445 








Arg 


Leu 


A en 
noil 


Ala 


Ala 


Cy s 


Asn 


Tro 


Thr 


Arg 


Glv 


Glu 


Arg 


Cys 


Asn 


He 


4 50 










455 










460 










Glu 


Asp 


Arg 


Asp 


Arg 


Ser 


Glu 


Leu 


Ser 


Pro 


Leu 


Leu 


Leu 


Ser 


Thr 


Thr 


465 








470 










475 










480 


Glu 


Trp 


Gin 


He 


Leu 


Pro 


Cys 


Ser 


Phe 


Thr 


Thr 


Leu 


Pro 


Ala 


Leu 


Ser 








485 










490 










495 




Thr 


Gly 


Leu 


He 


His 


Leu 


His 


Gin 


Asn 


He 


Val 


Asp 


Val 


Gin 


Tyr 


Leu 






500 










505 










510 






Tyr 


Gly 


Val 


Gly 


Ser 


Ala 


Val 


Val 


Ser 


He 


Val 


He 


Lys 


Trp 


Glu 


Tyr 


515 








520 










525 








Val 


Leu 


Leu 


Leu 


Phe 


Leu 


Leu 


Leu 


Ala 


Asp 


Ala 













530 535 



<210> 12 

<211> 1350 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Optimized sequence encoding HCV NS5a protein 



<400> 12 

atgtctggct 

aagacctggc 

cagaggggct 

ggcgcccaga 

tgctccaaca 

ccatcccctg 

gaggtgacca 

tgcccatgcc 

aggtatgece 



cctggctgag 
tgcattccaa 
acaggggegt 
tcactggcca 
cctggcatgg 
cccccaacta 
gggtgggcga 
aggtgcctgc 
ctgcctgcaa 



ggatgtctgg 
gctgctgccc 
ctggaggggc 
tgtgaagaat 
caccttcccc 
ctccagggcc 
cttccactat 
ccctgagttc 
gcccctgctg 



gactggatct 
aggctgectg 
gatggcgtga 
ggctccatga 
ateaatgect 
ctgtggaggg 
gtgactggca 
ttcactgagg 
agggatgagg 



gcactgtgct 
gcgacccatt 
tgcagaccac 
ggattgtggg 
acaccactgg 
tggctgctga 
tgaccactga 
tggatggcgt 
tgaccttcca 



gactgacttc 
cttctcctgc 
ctgcccatgt 
ccccaagacc 
cccatgcacc 
ggagtatgtg 
caatgtgaag 
gaggctgeae 
ggtgggcctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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aaccagttcc 
acctccatgc 
aggggctccc 
aaggccacct 
ctgtggaggc 
atcctggact 
gctgagatcc 
tcctacaacc 
catggctgcc 
actgtggtgc 
ttcggctcct 
ccatctgatg 
gagggcgagc 
gcctctgagg 



ctgtgggctc 
tgactgagcc 
ctccatccct 
gcaccaccag 
aggagatggg 
cctttgagcc 
tgaggaagtc 
cacccctgct 
ccctgccccc 
tgactgagtc 
ctggctcctc 
atggcgacag 
ctggcgaccc 
atgtggcctg 



ccagctgcca 
atcccacatc 
ggcctcctcc 
gcatgactcc 
cggcaacatc 
cctgagggct 
caggaagttc 
ggagtcctgg 
caccatggcc 
cactgtctcc 
tgctgtggac 
gggctctgat 
tgacctgtct 
ctgctcctaa 



tgtgagcctg 
actgctgaga 
tctgcctccc 
cctgatgctg 
accagggtgg 
gaggaggatg 
ccccctgccc 
aaggaccctg 
ccacccatcc 
tctgccctgg 
tctggcactg 
gatgagtcct 
gatggctcct 



agcctgatgt 
ctgccaagag 
agctgtctgc 
acctgattga 
agtctgagaa 
agagggaggt 
tgcccatctg 
actatgtgcc 
ccccacccag 
ctgagctggc 
ccacggcccc 
actcctccat 
ggtccactgt 



gactgtgctg 
gaggctggcc 
tccatccctg 
ggccaacctg 
caaggtggtg 
ctctgtggct 
ggcgaggcca 
ccctgtggtg 
gaggaagagg 
caccaagacc 
ccctgaccag 
gccccccctg 
ctctgaggag 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1350 



<210> 13 
<211> 449 
<212> PRT 

<213> Hepatitis C Virus 



<400> 13 



Met 


Ser 


Gly 


Ser 


Trp 


Leu 


Arg 


Asp 


Val 


Trp 


Asp 


Trp 


He 


Cys 


Thr 


Val 


1 








5 










10 










15 




Leu 


Thr 


Asp 


Phe 


Lys 


Thr 


Trp 


Leu 


His 


Ser 


Lys 


Leu 


Leu 


Pro 


Arg 


Leu 








20 










25 










30 






Pro 


Gly 


Asp 


Pro 


Phe 


Phe 


Ser 


Cys 


Gin 


Arg 


Gly 


Tyr 


Arg 


Gly 


Val 


Trp 






35 










40 










45 








Arg 


Gly 


Asp 


Gly 


Val 


Met 


Gin 


Thr 


Thr 


Cys 


Pro 


Cys 


Gly 


Ala 


Gin 


He 




50 










55 










60 










Thr 


Gly 


His 


Val 


Lys 


Asn 


Gly 


Ser 


Met 


Arg 


He 


Val 


Gly 


Pro 


Lys 


Thr 


65 










70 










75 










80 


Cys 


Ser 


Asn 


Thr 


Trp 


His 


Gly 


Thr 


Phe 


Pro 


He 


Asn 


Ala 


Tyr 


Thr 


Thr 










85 










90 










95 




Gly 


Pro 


Cys 


Thr 


Pro 


Ser 


Pro 


Ala 


Pro 


Asn 


Tyr 


Ser 


Arg 


Ala 


Leu 


Trp 








100 










105 










110 






Arg 


Val 


Ala 


Ala 


Glu 


Glu 


Tyr 


Val 


Glu 


Val 


Thr 


Arg 


Val 


Gly 


Asp 


Phe 






115 










120 










125 








His 


Tyr 


Val 


Thr 


Gly 


Met 


Thr 


Thr 


Asp 


Asn 


Val 


Lys 


Cys 


Pro 


Cys 


Gin 




130 










135 










140 










Val 


Pro 


Ala 


Pro 


Glu 


Phe 


Phe 


Thr 


Glu 


Val 


Asp 


Gly 


Val 


Arg 


Leu 


His 


145 










150 










155 










160 


Arg 


Tyr 


Ala 


Pro 


Ala 


Cys 


Lys 


Pro 


Leu 


Leu 


Arg 


Asp 


Glu 


Val 


Thr 


Phe 










165 










170 










175 




Gin 


Val 


Gly 


Leu 


Asn 


Gin 


Phe 


Pro 


Val 


Gly 


Ser 


Gin 


Leu 


Pro 


Cys 


Glu 








180 










185 










190 






Pro 


Glu 


Pro 


Asp 


Val 


Thr 


Val 


Leu 


Thr 


Ser 


Met 


Leu 


Thr 


Glu 


Pro 


Ser 






195 










200 










205 








His 


He 


Thr 


Ala 


Glu 


Thr 


Ala 


Lys 


Arg 


Arg 


Leu 


Ala 


Arg 


Gly 


Ser 


Pro 




210 










215 










220 










Pro 


Ser 


Leu 


Ala 


Ser 


Ser 


Ser 


Ala 


Ser 


Gin 


Leu 


Ser 


Ala 


Pro 


Ser 


Leu 


225 










230 










235 










240 


Lys 


Ala 


Thr 


Cys 


Thr 


Thr 


Arg 


His 


Asp 


Ser 


Pro 


Asp 


Ala 


Asp 


Leu 


lie 










245 










250 










255 




Glu 


Ala 


Asn 


Leu 


Leu 


Trp 


Arg 


Gin 


Glu 


Met 


Gly 


Gly 


Asn 


He 


Thr 


Arg 
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260 










265 










270 






Val 


Glu 


Ser 


Glu 


Asn 


Lys 


Val 


Val 


lie 


Leu 


Asp 


Ser 


Phe 


Glu 


Pro 


Leu 






275 










280 










285 








Arg 


Ala 


Glu 


Glu 


Asp 


Glu 


Arg 


Glu 


Val 


Ser 


Val 


Ala 


Ala 


Glu 


He 


Leu 




290 










295 










300 










Arg 


Lys 


Ser 


Arg 


Lys 


Phe 


Pro 


Pro 


Ala 


Leu 


Pro 


He 


Trp 


Ala 


Arg 


Pro 


305 










310 










315 










320 


Ser 


Tyr 


Asn 


Pro 


Pro 


Leu 


Leu 


Glu 


Ser 


Trp 


Lys 


Asp 


Pro 


Asp 


Tyr 


Val 










325 










330 










335 




Pro 


Pro 


Val 


Val 


His 


Gly 


Cys 


Pro 


Leu 


Pro 


Pro 


Thr 


Met 


Ala 


Pro 


Pro 








340 










345 










350 






lie 


Pro 


Pro 


Pro 


Arg 


Arg 


Lys 


Arg 


Thr 


Val 


Val 


Leu 


Thr 


Glu 


Ser 


Thr 






355 










360 










365 








Val 


Ser 


Ser 


Ala 


Leu 


Ala 


Glu 


Leu 


Ala 


Thr 


Lys 


Thr 


Phe 


Gly 


Ser 


Ser 




370 










375 










380 










Gly 


Ser 


Ser 


Ala 


Val 


Asp 


Ser 


Gly 


Thr 


Ala 


Thr 


Ala 


Pro 


Pro 


Asp 


Gin 


385 










390 










395 










400 


Pro 


Ser 


Asp 


Asp 


Gly 


Asp 


Arg 


Gly 


Ser 


Asp 


Asp 


Glu 


Ser 


Tyr 


Ser 


Ser 










405 










410 










415 




Met 


Pro 


Pro 


Leu 


Glu 


Gly 


Glu 


Pro 


Gly 


Asp 


Pro 


Asp 


Leu 


Ser 


Asp 


Gly 








420 










425 










430 






Ser 


Trp 


Ser 


Thr 


Val 


Ser 


Glu 


Glu 


Ala 


Ser 


Glu 


Asp 


Val 


Ala 


Cys 


Cys 



435 440 445 

Ser 



<210> 14 
<211> 1773 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Optimized sequence encoding HCV NS5b protein 



<400> 14 

atgtcctaca 

cccatcaacc 

tccaggtctg 

gaccactaca 

ctgctgtctg 

ggctatggcg 

gtctggaagg 

aatgaggtct 

ttccctgagc 

ctgccccagg 

gagttcctgg 

aggtgctttg 

tgctgtgacc 

attggcggcc 

tctggcgtgc 

gcttgcaggg 

gtgatctgtg 

gccatgacca 

ctgatcacct 

tactacctga 



cctggactgg 
ccctgtccaa 
ctggcctgag 
gggatgtgct 
tggaggaggc 
ccaaggatgt 
acctgctgga 
tctgtgtgca 
tgggcgtgag 
ctgtgatggg 
tgaatgcctg 
actccactgt 
tggctcctga 
ccctgaccaa 
tgaccactaa 
ctgccaagct 
agtctgctgg 
ggtactctgc 
cctgctcctc 
ccagggaccc 



cgccctgatc 
ctccctgctg 
gcagaagaag 
gaaggagatg 
ctgcaagctg 
gaggaacctg 
ggacactgag 
gcctgagaag 
ggtgtgtgag 
ctcctcctat 
gaagtccaag 
gactgagtct 
ggccaggcag 
ctccaagggc 
ctgtggcaac 
gcatgactgc 
cacccaggag 
cccccctggc 
caatgtctct 
caccaccccc 



accccatgtg 
aggcatcaca 
gtgacctttg 
aaggccaagg 
acccctcccc 
tcctccaagg 
acccccattg 
ggcggcagga 
aagatggccc 
ggcttccagt 
aagaacccca 
gacatcaggg 
gtgatcaggt 
cagaactgtg 
accctgacct 
accatgctgg 
gatgctgcct 
gaccctcccc 
gtggcccatg 
ctggccaggg 



ctgctgagga 
acatggtcta 
acaggctgca 
cctccactgt 
actctgccag 
ctgtgaacca 
acaccaccat 
agcctgccag 
tgtatgatgt 
actcccctgg 
tgggctttgc 
tggaggagtc 
ccctgactga 
gctacaggag 
gctacctgaa 
tctgtggcga 
ccctgagggt 
agcctgagta 
atgcctctgg 
ctgcctggga 



gtccaagctg 
tgccaccacc 
tgtgcctgat 
gaaggcgaag 
gtccaagttt 
catccactct 
catggccaag 
gctgattgtc 
ggtctccacc 
ccagagggtg 
ctactgcacc 
catctaccag 
gaggctgtac 
gtgcagggcc 
ggcctctgct 
tgacctggtg 
cttcactgag 
tgacctggag 
caagagggtc 
gactgccagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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cacacccctg 
aggatgatcc 
gccctgggct 
atcatccaga 
atcaacaggg 
cacagggcca 
ggcaagtacc 
gcctcccagc 
cactccctgt 
ggcgtgggca 



tgaactcctg 
tgatgaccca 
gccagattta 
ggctgcatgg 
tggcctcctg 
ggtctgtgag 
tgttcaactg 
tggacctgtc 
ccagggccag 
tctacctgct 



gctgggcaac 
cttcttctcc 
tggcgccacc 
cctgtctgcc 
cctgaggaag 
ggccaagctg 
ggctgtgagg 
tggctggttt 
gcccaggtgg 
gcccaacagg 



atcatcatgt 
atcctgctgg 
tacttcattg 
ttctccctgc 
ctgggcgtgc 
ctgtcccagg 
accaagctga 
gtggctggct 
ttcatgtggt 
tga 



atgcccccac 
cccaggagca 
agcccctgga 
actcctactc 
cccccctgag 
gcggcagggc 
agctgacccc 
actctggcgg 
gcctgctgct 



cctgtgggcc 
gctggagaag 
cctgccccag 
ccctggcgag 
ggtgtggagg 
tgccacctgt 
catccctgct 
cgacatctac 
gctgtctgtg 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1773 



<210> 15 
<211> 590 
<212> PRT 

<213> Hepatitis C Virus 



<400> 15 
Met Ser Tyr Thr 
1 

Glu Ser Lys Leu 
20 

His Asn Met Val 
35 

Lys Lys Val Thr 
50 

Asp Val Leu Lys 
65 

Leu Leu Ser Val 

Arg Ser Lys Phe 
100 

Lys Ala Val Asn 
115 

Thr Glu Thr Pro 
130 

Cys Val Gin Pro 
145 

Phe Pro Glu Leu 

Val Val Ser Thr 
180 

Gin Tyr Ser Pro 
195 

Ser Lys Lys Asn 
210 

Ser Thr Val Thr 
225 

Cys Cys Asp Leu 

Glu Arg Leu Tyr 
260 

Cys Gly Tyr Arg 
275 

Gly Asn Thr Leu 
290 



Trp 


Thr 


Gly 


Ala 


5 








Pro 


He 


Asn 


Pro 


Tyr 


Ala 


Thr 


Thr 








40 


Php 




Arg 


Leu 






55 




Glu 


Met 


Lys 


Ala 




70 






Glu 


Glu 


Ala 


Cys 


85 








Gly 


Tyr 


Gly 


Ala 


His 


He 


His 


Ser 








120 


He 


Asp 


Thr 


Thr 






135 




Glu 


Lys 


Gly 


Gly 




150 






Gly 


Val 


Arg 


Val 


165 








Leu 


Pro 


Gin 


Ala 


Gly 


Gin 


Arg 


Val 








200 


Pro 


Met 


Gly 


Phe 






215 




Glu 


Ser 


Asp 


lie 




230 






Ala 


Pro 


Glu 


Ala 


245 








He 


Gly 


Gly 


Pro 


Arg 


Cys 


Arg 


Ala 








280 


Thr 


Cys 


Tyr 


Leu 






295 





Leu He Thr Pro 
10 

Leu Ser Asn Ser 
25 

Ser Arg Ser Ala 

His Val Pro Asp 
60 

Lys Ala Ser Thr 
75 

Lys Leu Thr Pro 
90 

Lys Asp Val Arg 
105 

Val Trp Lys Asp 

lie Met Ala Lys 
140 

Arg Lys Pro Ala 
155 

Cys Glu Lys Met 
170 

Val Met Gly Ser 
185 

Glu Phe Leu Val 

Ala Tyr Cys Thr 
220 

Arg Val Glu Glu 
235 

Arg Gin Val lie 

250 

Leu Thr Asn Ser 
265 

Ser Gly Val Leu 

Lys Ala Ser Ala 
300 



Cys Ala Ala Glu 
15 

Leu Leu Arg His 
30 

Gly Leu Arg Gin 
45 

Asp His Tyr Arg 

Val Lys Ala Lys 
80 

Pro His Ser Ala 
95 

Asn Leu Ser Ser 
110 

Leu Leu Glu Asp 
125 

Asn Glu Val Phe 

Arg Leu lie Val 
160 

Ala Leu Tyr Asp 
175 

Ser Tyr Gly Phe 
190 

Asn Ala Trp Lys 
205 

Arg Cys Phe Asp 

Ser He Tyr Gin 
240 

Arg Ser Leu Thr 
255 

Lys Gly Gin Asn 
270 

Thr Thr Asn Cys 
285 

Ala Cys Arg Ala 
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Aid 


Ly s 


Le u 


Hie: 
ill o 


Asp 


Cy s 


Thr 


Met 


Leu 


Val 


Cys 


Gly 


Asp 


Asp 


Leu 


Val 


jU J 








310 










315 










320 


Vdl 


T 1 o 


C w <^ 


Glu 


Ser 


Ala 


Gl v 


Thr 


Gin 


Glu 


Asp 


Ala 


Ala 


Ser 


Leu 


Arg 








32 5 










330 










335 




v ci J. 




Thr 


Glu 


Ala 


Met 


Thr 


Ar a 


Tvr 


Ser 


Ala 


Pro 


Pro 


Gly 


Asp 


Pro 








340 










345 










350 






r I U 


m n 


Pro 


Glu 


T v r 
i y i 


Asp 


Leu 


Glu 


Leu 


He 


Thr 


Ser 


Cys 


Ser 


Ser 


Asn 






355 










360 










365 








V d X 




Val 


Ala 


His 


Asp 


Ala 


Ser 


Gly 


Lys 


Arg 


Val 


Tyr 


Tyr 


Leu 


Thr 




^7 0 
^ i \j 










375 










380 










Arg 




P r r\ 

C X \J 


Thr 


Th r 


Pro 


Leu 


Ala 


Arg 


Ala 


Ala 


Trp 


Glu 


Thr 


Ala 


Arg 


JO J 








390 










395 










400 


nlS 


1 1 1 J. 


P r - o 
IT X 


v d x 


As n 


Ser 


Tro 


Leu 


Glv 


Asn 


He 


He 


Met 


Tyr 


Ala 


Pro 










4 0 S 

4 U J 










410 










415 




rp V-, y~ 

1 nr 


Leu 


Trp 


nld 


Arg 


Met 


lie 


Leu 


Met 


Thr 


His 


Phe 


Phe 


Ser 


He 


Leu 






420 










425 










430 






Le u 


rt.X d 


m n 

X i 1 


Glu 


Gin 


Leu 


Glu 


Lys 


Ala 


Leu 


Gly 


Cys 


Gin 


He 


Tyr 


Gly 






435 










440 










445 








i-\l d 




i yr 


Phe 


lie 


Glu 


Pro 


Leu 


Asp 


Leu 


Pro 


Gin 


He 


He 


Gin 


Arg 




4 J U 








455 










460 










L6U 


U -1 c 

nio 


o x y 


Leu 


Ser 


Ala 


Phe 


Ser 


Leu 


His 


Ser 


Tyr 


Ser 


Pro 


Gly 


Glu 


4 £ R 
*4 O D 








47 0 










475 










480 


Tl Q 

lie 


As n 


Arg 


v d x 


Ala 


Ser 


Cys 


Leu 


Arg 


Lys 


Leu 


Gly 


Val 


Pro 


Pro 


Leu 








4 ft S 

*i O -J 










490 










495 




Arg 


Vdl 


Trp 


Arg 


n x o 


Arg 


Ala 


Arg 


Ser 


Val 


Arg 


Ala 


Lvs 


Leu 


Leu 


Ser 






O U U 










505 










510 






PI n 
O X 1 1 


uiy 


uiy 


Arg 


Ala 


Ala 


Thr 


Cys 


Glv 


Lys 


Tyr 


Leu 


Phe 


Asn 


Trp 


Ala 




515 








520 










525 








Val 


Arg 


Thr 


Lys 


Leu 


Lys 


Leu 


Thr 


Pro 


He 


Pro 


Ala 


Ala 


Ser 


Gin 


Leu 




530 










535 










540 










Asp 


Leu 


Ser 


Gly 


Trp 


Phe 


Val 


Ala 


Gly 


Tyr 


Ser 


Gly 


Gly 


Asp 


He 


Tyr 


545 










550 










555 










560 


His 


Ser 


Leu 


Ser 


Arg 


Ala 


Arg 


Pro 


Arg 


Trp 


Phe 


Met 


Trp 


Cys 


Leu 


Leu 










565 










570 










575 




Leu 


Leu 


Ser 


Val 


Gly 


Val 


Gly 


He 


Tyr 


Leu 


Leu 


Pro 


Asn 


Arg 












580 










585 










590 







<210> 16 
<211> 103 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Modified Vector Sequence 
<400> 16 

ettaagegaa ggttactctt gtacctctgg tacttggtcg ggatggtgta gacggcgccg 
aagtggacga agttcttcat ttgggccctt aagatttcag ctg 

<210> 17 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Modified Vector Sequence 
<400> 17 

agatctacca tgagc 

<210> 18 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Modified Vector Sequence 
<400> 18 

gccgaattcg cttcc 

<210> 19 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Modified Vector Sequence 
<400> 19 

taaacccggg aattctaaag tcgac 

<210> 20 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Modified Vector Sequence 

<400> 20 
atcaccatgg at 

<210> 21 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Modified Vector Sequence 
<400> 21 

gagatcttca tgagc 

<210> 22 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Modified Vector Sequence 
<400> 22 

agatccacca tgcag 

<210> 23 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Modified Vector Sequence 
<400> 23 

ggtgcagatc tgatgagc 

<210> 24 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Modified Vector Sequence 

<400> 24 
gcctaaagtc gac 

<210> 25 
<211> 4261 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Modified Vector Sequence 



15 



18 



13 



<400> 25 

gatattggct 

ggctcatgtc 

tcaattacgg 

gtaaatggcc 

tatgttccca 

cggtaaactg 

gacgtcaatg 

tttcctactt 

tggcagtaca 

cccattgacg 

cgtaacaact 

ataagcagag 

gacctccata 

acgcggattc 

cccttggctt 

ctcatgttat 

ccactcccct 

aactctcttt 

atttttacag 



attggccatt 
caacattacc 
ggtcattagt 
cgcctggctg 
tagtaacgcc 
cccacttggc 
acggtaaatg 
ggcagtacat 
tcaatgggcg 
tcaatgggag 
ccgccccatt 
ctcgtttagt 
gaagacaccg 
cccgtgccaa 
cttatgcatg 
aggtgatggt 
attggtgacg 
attggctata 
gatggggtct 



gcatacgttg 
gccatgttga 
tcatagccca 
accgcccaac 
aatagggact 
agtacatcaa 
gcccgcctgg 
ctacgtatta 
tggatagcgg 
tttgttttgg 
gacgcaaatg 
gaaccgtcag 
ggaccgatcc 
gagtgacgta 
ctatactgtt 
atagcttagc 
atactttcca 
tgccaataca 
catttattat 



tatccatatc 
cattgattat 
tatatggagt 
gacccccgcc 
ttccattgac 
gtgtatcata 
cattatgccc 
gtcatcgcta 
tttgactcac 
caccaaaatc 
ggcggtaggc 
atcgcctgga 
agcctccgcg 
agtaccgcct 
tttggcttgg 
ctataggtgt 
ttactaatcc 
ctgtccttca 
ttacaaattc 



ataatatgta 
tgactagtta 
tccgcgttac 
cattgacgtc 
gtcaatgggt 
tgccaagtac 
agtacatgac 
ttaccatggt 
ggggatttcc 
aacgggactt 
gtgtacggtg 
gacgccatcc 
gccgggaacg 
atagagtcta 
ggtctataca 
gggttattga 
ataacatggc 
gagactgaca 
acatatacaa 



catttatatt 
ttaatagtaa 
ataacttacg 
aataatgacg 
ggagtattta 
gccccctatt 
cttatgggac 
gatgcggttt 
aagtctccac 
tccaaaatgt 
ggaggtctat 
acgctgtttt 
gtgcattgga 
taggcccacc 
cccccgcttc 
ccattattga 
tctttgccac 
cggactctgt 
caccaccgtc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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cccagtgccc gcagttttta ttaaacataa 
tgttccggac atgggctctt ctccggtagc 
catgcctcca gcgactcatg gtcgctcggc 
cttaggcaca gcacgatgcc caccaccacc 
tatgtgtctg aaaatgagct cggggagcgg 
aaggcagcgg cagaagaaga tgcaggcagc 
gtaactcccg ttgcggtgct gttaacggtg 
gctgccgcgc gcgccaccag acataatagc 
ggtcttttct gcagtcaccg tccttagatc 
gaagaccaag aggaacacca acaggaggcc 
gattgtggga ggggtctacc tgctgcccag 
caggaagacc tctgagaggt cccagcccag 
gaggcctgag ggccgctcct gggcccagcc 
aggctttggc tgggctggct ggctgctgtc 
cacagacccc aggaggaggt ccaggaacct 
ctttgctgac ctgatgggct acatccccct 
ggctctggct catggggtga gggtgctgga 
gcctggctgc tccttctcca tcttcctgct 
ttctgccgaa ttcgcttcca atgagaacat 
ccgcggcttc acctgcttca agaagtaaac 
atctgctgtg ccttctagtt gccagccatc 
gaccctggaa ggtgccactc ccactgtcct 
ttgtctgagt aggtgtcatt ctattctggg 
ggattgggaa gacaatagca ggcatgctgg 
agcggcctta attaaggccg cagcggccgt 
tcgacccgta aaaaggccgc gttgctggcg 
catcacaaaa atcgacgctc aagtcagagg 
caggcgtttc cccctggaag ctccctcgtg 
ggatacctgt ccgcctttct cccttcggga 
aggtatctca gttcggtgta ggtcgttcgc 
gttcagcccg accgctgcgc cttatccggt 
cacgacttat cgccactggc agcagccact 
ggcggtgcta cagagttctt gaagtggtgg 
tttggtatct gcgctctgct gaagccagtt 
tccggcaaac aaaccaccgc tggtagcggt 
cgcagaaaaa aaggatctca agaagatcct 
ctgccagtgt tacaaccaat taaccaattc 
aaactgcaat ttattcatat caggattatc 
taatgaagga gaaaactcac cgaggcagtt 
tgcgattccg actcgtccaa catcaataca 
gttatcaagt gagaaatcac catgagtgac 
atgcatttct ttccagactt gttcaacagg 
cgcatcaacc aaaccgttat tcattcgtga 
gctgttaaaa ggacaattac aaacaggaat 
cgcatcaaca atattttcac ctgaatcagg 
cccggggatc gcagtggtga gtaaccatgc 
ggtcggaaga ggcataaatt ccgtcagcca 
attggcaacg ctacctttgc catgtttcag 
caatcgatag attgtcgcac ctgattgccc 
taaatcagca tccatgttgg aatttaatcg 
atggctcata acaccccttg tattactgtt 
tgatatattt ttatcttgtg caatgtaaca 



cgtgggatct ccacgcgaat ctcgggtacg 1200 
ggcggagctt ctacatccga gccctgctcc 1260 
agctccttgc tcctaacagt ggaggccaga 1320 
agtgtgccgc acaaggccgt ggcggtaggg 1380 
gcttgcaccg ctgacgcatt tggaagactt 1440 
tgagttgttg tgttctgata agagtcagag 1500 
gagggcagtg tagtctgagc agtactcgtt 1560 
tgacagacta acagactgtt cctttccatg 1620 
taccatgagc accaacccca agccccagag 1680 
ccaggatgtg aagttccctg ggggaggcca 1740 
gaggggcccc aggctggggg tgagggctac 1800 
gggcaggagg cagcccatcc ccaaggccag 1860 
tggctacccc tggcccctgt atggcaatga 1920 
ccccaggggc tccaggccct cctggggccc 1980 
gggcaaggtg attgacaccc tgacctgtgg 2040 
ggtgggggct cctgtgggag gggtggctag 2100 
ggatggggtg aactatgcta ctggcaacct 2160 
ggccctgctc tcctgcctga cagtgcctgc 2220 
ggagaccatg aaccagccct accacatctg 2280 
ccgggaattc taaagtcgac agcggccgcg 2340 
tgttgtttgc ccctcccccg tgccttcctt 2400 
ttcctaataa aatgaggaaa ttgcatcgca 2460 
gggtggggtg gggcagcaca gcaaggggga 2520 
ggatgcggtg ggctctatgg gtacggccgc 2580 
acccaggtgc tgaagaattg acccggttcc 2640 
tttttccata ggctccgccc ccctgacgag 2700 
tggcgaaacc cgacaggact ataaagatac 2760 
cgctctcctg ttccgaccct gccgcttacc 2820 
agcgtggcgc tttctcaatg ctcacgctgt 2880 
tccaagctgg gctgtgtgca cgaacccccc 2940 
aactatcgtc ttgagtccaa cccggtaaga 3000 
ggtaacagga ttagcagagc gaggtatgta 3060 
cctaactacg gctacactag aaggacagta 3120 
accttcggaa aaagagttgg tagctcttga 3180 
ggtttttttg tttgcaagca gcagattacg 3240 
ttgatctttt ctacgtgatc ccgtaatgct 3300 
tgattagaaa aactcatcga gcatcaaatg 3360 
aataccatat ttttgaaaaa gccgtttctg 3420 
ccataggatg gcaagatcct ggtatcggtc 3480 
acctattaat ttcccctcgt caaaaataag 3540 
gactgaatcc ggtgagaatg gcaaaagctt 3600 
ccagccatta cgctcgtcat caaaatcact 3660 
ttgcgcctga gcgagacgaa atacgcgatc 3720 
cgaatgcaac cggcgcagga acactgccag 3780 
atattcttct aatacctgga atgctgtttt 3840 
atcatcagga gtacggataa aatgcttgat 3900 
gtttagtctg accatctcat ctgtaacatc 3960 
aaacaactct ggcgcatcgg gcttcccata 4020 
gacattatcg cgagcccatt tatacccata 4080 
cggcctcgag caagacgttt cccgttgaat 4140 
tatgtaagca gacagtttta ttgttcatga 4200 
tcagagattt tgagacacaa cgtggctttc 4260 

42 61 



- 15 - 



